Grapheme-to-Phoneme Conversion for Korean Point-of-Interest Data Using Prosodic Words

نویسندگان

  • Sunhee Kim
  • Je Hun Jeon
چکیده

Point-of-Interest (POI) data are location enriched word data, which consist of personal names, place names including administrative districts, and names of business and recreational locations. They are essential to the development of applications such as Telematics, wireless location-based services (LBS), and tourism and other business applications. Being a major component of Text-to-Speech (TTS) systems and automatic speech recognition (ASR) systems, a Grapheme-to-Phoneme (G2P) conversion system maps the orthography to its corresponding sequence of phonemes. When POI data are converted into corresponding pronunciations, two characteristics of the data need to be considered. First, they are proper names, which are known to be closely related to irregular pronunciations in Korean. Second, most POI data are compound or complex words consisting of more than two words. A constituent of a POI word belongs to many other words. Therefore, it is a prerequisite to separate each word into its constituents in order to derive its pronunciation which includes irregular pronunciations. In this paper, we propose a G2P conversion system that detects irregular pronunciations for Korean POI data, by adopting the notion of prosodic word based on Prosodic Phonology (Selkirk 1984). The performance of the proposed system depends on the effective detection of irregular pronunciation words. According to our proposed system, all the words in the POI data were broken down into prosodic words, which were trained by category (regular, irregular or referential). The trained words were then used to create a dictionary for each category. By adopting this proposed system, irregular pronunciation words could be detected with increased efficiency using a far lesser number of words than other methods. For detecting words of irregular pronunciation, only 22.2% of input data on average were required for manual review for our proposed system, compared to 87% required when using the exceptions detection approach without using prosodic word as unit, and to 100% required when using the regular data-driven approach. The results of our experiment showed that 11.43% of 300k POI data were detected as being words of irregular pronunciation. Compared to the 6.67% obtained in a text corpus, POI data consisting of proper names comprise a much higher percentage of irregular pronunciations than ordinary text corpus. By combining phonology with computational modeling, the proposed system not only provides enhanced performance but also, assuming that phonology is a subsystem of the mind/brain (Durand & Laks 2002) or a branch of the study of mental representation, the psychology of mind (Hale & Reiss 2000), reflects the same method by which a speaker derives pronunciation of morphologically complex Korean proper names..

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Korean grapheme-to-phoneme conversion system using selection procedure for exceptions

Cultural, social, economic and other various environmental factors affect our language, and different words and terminology are used and coined for different contexts, which triggers quantitative change of vocabulary of a language. This paper presents a Korean grapheme-to-phoneme conversion system using a selection procedure for exceptions from added text corpus, which reflects such dynamic nat...

متن کامل

Rule-based Korean Grapheme to Phoneme Conversion Using Sound Patterns

Grapheme-to-phoneme conversion plays an important role in text-to-speech applications and other fields of computational linguistics. Although Korean uses a phonemic writing system, it must have a grapheme-to-phoneme conversion for speech synthesis because Korean writing system does not always reflect its actual pronunciations. This paper describes a grapheme-to-phoneme conversion method based o...

متن کامل

Solving the Phoneme Conflict in Grapheme-to-Phoneme Conversion Using a Two-Stage Neural Network-Based Approach

To achieve high quality output speech synthesis systems, data-driven grapheme-to-phoneme (G2P) conversion is usually used to generate the phonetic transcription of out-of-vocabulary (OOV) words. To improve the performance of G2P conversion, this paper deals with the problem of conflicting phonemes, where an input grapheme can, in the same context, produce many possible output phonemes at the sa...

متن کامل

Phonology of Exceptions For Korean Grapheme-to-Phoneme Conversion

Being an essential part of a Korean speech recognition system and a Text-To-Speech (TTS) system, a Korean Grapheme-toPhoneme conversion system is generally composed of a set of regular rules and an exceptions dictionary [1, 2, 3]. The exceptions have been recorded in the dictionary in a simple and random manner, whereas the researches on the regular rules have been actively progressed. This pap...

متن کامل

Unlimited Vocabulary Grapheme to PhonemeConversion with Probabilistic Phrase Break Detection

This paper describes a grapheme-to-phoneme conversion method using phoneme con-nectivity and CCV conversion rules with probabilistic phrase break detection. The method consists of mainly four modules including phrase-break detection, morpheme normalization, morpheme to phoneme conversion and phoneme connectivity check. In the experiments with a test corpus of 210 sentences, we achieved 85% of p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006